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Abstract. For each positive-integer valued arithmetic function /, let ~ff C N denote the 
image of /, and put 'ff(x) := "ff fl [l,x] and Vf(x) := #Yf(x). Recently Ford, Luca, and 
Pomerance showed that n % is infinite, where </> denotes Euler's totient function and a 
is the usual sum-of-divisors function. Work of Ford shows that V$(x) >c (a;) as x — > oo. 
Here we prove a result complementary to that of Ford et al., by showing that most Rvalues 
are not cr-values, and vice versa. More precisely, we prove that as x — > oo, 



o 



> 
o 



1. Introduction 



J ■ 1.1. Summary of results. For each positive-integer valued arithmetic function /, let Yf C 

N denote the image of /, and put Yf(x) := Yf fl [l,x] and Vf(x) := #Yf(x). In this paper 
we are primarily concerned with the cases when f = <f), the Euler totient function, and when 
/ = a, the usual sum-of-divisors function. When / = <f), the study of the counting function 
Vf goes back to Pillai [Pil29], and was subsequently taken up by Erdos [Erd35, Erd45], 
Erdos and Hall [EH73, EH76], Pomerance [Pom86], Maier and Pomerance [MP88], and Ford 
[For98a] (with an announcement in [For98b]). From the sequence of results obtained by these 
authors, we mention Erdos's asymptotic formula (from [Erd35]) for log Vf ^ , namely 

g: (i-i) y >M = (i^w (*->») 

and the much more intricate determination of the precise order of magnitude by Ford, 
^ . (1.2) V f (x) x -^-exp(c7(log 3 x - log 4 x) 2 + D\og 3 x - (D + 1/2 - 2C) log 4 x). 

Here log fc denotes the kin. iterate of the natural logarithm, and the constants C and D are 
defined as follows: Let 

oo 

(1.3) F{z) := a n z n , where a n — (n + 1) log(n + 1) — nlogn — 1. 

n=l 

Since each a n > and a n ~ logra as n — > oo, it follows that F(z) converges to a continuous, 
strictly increasing function on (0, 1), and F(z) — > oo as z t 1. Thus, there is a unique real 
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N 


V>(N) 


VAN) 






V^(N)/V a (N) 


10000 


2374 


2503 


1368 


0.5762426 


0.5465441 


100000 


20254 


21399 


11116 


0.5488299 


0.5194635 


1000000 


180184 


189511 


95145 


0.5280436 


0.5020553 


10000000 


1634372 


1717659 


841541 


0.5149017 


0.4899348 


100000000 


15037909 


15784779 


7570480 


0.5034264 


0.4796063 


1000000000 


139847903 


146622886 


69091721 


0.4940490 


0.4712206 


Table 1. 


Data on 0-values, cr-va 


ues, and common values up to N = 10 fc , 


from k = 5 to k = 9. 











number q for which 

(1.4) F(q) = \ (£ = 0.542598586098471021959...). 

In addition, F' is strictly increasing, and F'(g) = 5.697758.... Then C = ^ J gg | = 
0.817814... and D = 2C(1 + logF'(g) - log(2C)) - 3/2 = 2.176968.... In [For98a], it 
is also shown that (1.2) holds for a wide class of 0-like functions, including f — a. Conse- 
quently, V^(x) x V a (x). 

Erdos (see [Erd59, 8, p. 172] or [EG80]) asked if it could be proved that infinitely many 
natural numbers appear in both ^ and This question was recently answered by Ford, 
Luca, and Pomerance [FLP10]. Writing V^ j<T (x) for the number of common elements of 
and ^ up to i, they proved that 

V<t>A x ) ^ exp((loglogx) c ) 

for some positive constant c > and all large x (in [GarlO] this is shown for all constants 
c > 0). This lower bound is probably very far from the truth. For example, if p and p + 2 
form a twin prime pair, then 4>{p + 2) = p+1 = <r(p); a quantitative form of the twin prime 
conjecture then implies that V^^x) ^> x/(\ogx) 2 . In Part I, we showed that a stronger 
conjecture of the same type allows for an improvement. Roughly, our result is as follows: 

Theorem A. Assume a strong uniform version of Dickson's prime k-tuples conjecture. Then 
as x —>• oo ; 

V<fA x ) = 



(\ogx) 1 +°W 

Theorem A suggests that V^ t(T (x) is much larger than we might naively expect. This 
naturally leads one to inquire about what can be proved in the opposite direction; e.g., 
could it be that a positive proportion of 0- values are also a-values? The numerical data up 
to 10 9 , exhibited in Table 1.1, suggests that the proportion of common values is decreasing, 
but the observed rate of decrease is rather slow. Our principal result is the following estimate, 
which implies in particular that almost all 0-values are not cr-values, and vice versa. 

Theorem 1.1. As x — > oo, 

V^x) + V a (x) 



(1.5) V^(x) ^ 



(log \ogxy/ 2 +°w 



The proof of Theorem 1.1 relies on the detailed structure theory of totients as developed 
in [For98a]. It would be interesting to know the true rate of decay of V^ !er (x)/V < j > (x). 
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1.2. Sketch. Since the proof of Theorem 1.1 is rather intricate and involves a number of 
technical estimates, we present a brief outline of the argument in this section. 

We start by discarding a sparse set of undesirable and a-values. More precisely, we 
identify (in Lemma 3.2) convenient sets and srf c with the property that almost all 0- 
values ^ x have all their preimages in ^ and almost all a- values ^ x have all their preimages 
in srf a . This reduces us to studying how many and a- values arise as solutions to the equation 

0(a) = a(a'), where a G stf^, a' G srf a . 

Note that to show that V^ a {x)jV^{x) — > 0, we need only count the number of common 0-<r- 
values of this kind, and not the (conceivably much larger) number of pairs (a, a') G x srf c 
corresponding to these values. 

What makes the sets and srf a convenient for us? The properties imposed in the defini- 
tions of these sets are of two types, anatomical and structural. By anatomical considerations, 
we mean general considerations of multiplicative structure as commonly appear in elementary 
number theory (e.g., consideration of the number and size of prime factors). By structural 
considerations, we mean those depending for their motivation on the fine structure theory 
of totients developed by Ford [For98a]. 

Central to our more anatomical considerations is the notion of a normal prime. Hardy 
and Ramanujan [HR00] showed that almost all natural numbers ^ x have ~ log log a; prime 
factors, and Erdos [Erd35] showed that the same holds for almost all shifted primes p— 1 ^ x. 
Moreover, sieve methods imply that if we list the prime factors of p— 1 on a double-logarithmic 
scale, then these are typically close to uniformly distributed in [0, log log p]. Of course, all of 
this remains true with p + 1 in place of p — 1. We assume that the numbers belonging to 
and s^ a have all their prime factors among this set of normal primes. 

If we assume that numbers n all of whose prime factors are normal generate "most" 
/-values (for / G {(f), cr}), we are led to a series of linear inequalities among the (double- 
logarithmically renormalized) prime factors of n. These inequalities are at the heart of the 
structure theory of totients as developed in [For98a]. As one illustration of the power of 
this approach, mapping the L largest prime factors of n (excluding the largest) to a point 
in IR L , the problem of estimating Vf(x) reduces to the problem of finding the volume of 
a certain region of 1R L , called the fundamental simplex. In broad strokes, this is how one 
establishes Ford's bound (1.2). We incorporate these linear inequalities into our definitions 
of g/ff, and stf a . One particular linear combination of renormalized prime factors appearing 
in the definition of the fundamental simplex is of particular interest to us (see condition (8) 
in the definition of &/f in §3 below); that we can assume this quantity is < 1 is responsible 
for the success of our argument. 

Suppose now that we have a solution to 0(a) = a(a'), where (a, a') G ^ x stf a . We write 
a = P0P1P2 • • ■ and a' = q§q\q2 • • • , where the sequences of Pi and qj are non-decreasing. We 
cut the first of these lists in two places; at the kth prime p k and at the Lth place p L . The 
precise choice of k and L is somewhat technical; one should think of the primes p; larger than 
Pk as the "large" prime divisors of a, those smaller than pk but larger than pi as "small', 
and those smaller than pi as "tiny". The equation 0(a) = fr(a') can be rewritten in the form 

(1.6) (po - l)(pi - l)(p 2 - 1) • • • ipk-i - l)fd = (g + l)(gi + l)(g 2 + 1) • • • + l)e, 
where 



(1.7) / :=0(p fc ---p L _i), d := (p(p L p L+1 - ■ ■), and e := a(q k q k+1 ■ ■ ■ ). 
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To see that (1.6) correctly expresses the relation 0(a) = cr(a'), we recall that the primes 
Pi, . . . , pk are all large, so that by the "anatomical" constraints imposed in the definition of 
srffa each appears to the first power in the prime factorization of a. An analogous statement 
holds for the primes qi,---,qk] this follows from the general principle, established below, 
that Pi ~ qi provided that either side is not too small. There is one respect in which (1.7) 
may not be quite right: Since pi is tiny, we cannot assume a priori that pl ^ Pl-i, and so it 
may be necessary to amend the definition of d somewhat; we ignore this (ultimately minor) 
difficulty for now. 

To complete the argument, we fix d and estimate from above the number of solutions 
(consisting of po, ■ ■ ■ ,Pk-i, qo, ■ ■ ■ , qk-i, e, /) to the relevant equations of the form (1.6); then 
we sum over d. The machinery facilitating these estimates is encoded in Lemma 4.1, which 
is proved by a delicate, iterative sieve argument of a kind first introduced by Maier and 
Pomerance [MP88] and developed in [For98a, §5]. The hypotheses of that lemma include 
several assumptions about the pi and qj, and about e, /, and d. All of these rather technical 
hypotheses are, in our situation, consequences of our definitions of ^ and s^ c \ we say more 
about some of them in a remark following Lemma 4.1. 

Notation. Let P + (n) denote the largest prime factor of n, understood so that P + (l) = 1, 
and let Q(n, U,T) denote the total number of prime factors p of n such that U < p ^ T, 
counted according to multiplicity. Constants implied by the Landau O— and Vinogradov 
<C — and 3> — symbols are absolute unless otherwise specified. Symbols in boldface type 
indicate vector quantities. 



2. Preliminaries 

2.1. Anatomical tools. We begin with two tools from the standard chest. The first is a 
form of the upper bound sieve and the second concerns the distribution of smooth numbers. 

Lemma 2.1 (see, e.g., [HR74, Theorem 4.2]). Suppose ai,...,a,h are positive integers and 
bi, . . . , bh are integers such that 

h 

/ Y\."' If (°A ~ a A) o. 

i=l l^i<j^h 

Then 

#{n < x : a^ + h prime {l^i^ h)} << T^UtT^K << ^f^ 1 + 2)) " , 

(\0gx) h J A (1 _ L)h (\ogx) h 

where v{p) is the number of solutions of the congruence Y\{ an i + h) = (mod p), and the 
implied constant may depend on h. 

Let ^(x, y) denote the number of n ^ x for which P + (n) $C y. The following estimate is 
due to Canfield, Erdos, and Pomerance [CEP83]: 

Lemma 2.2. If2^y^x and u = log xj logy, then 

m(x,y) = x/u u+o{u) 

for u ^ y 1 ^ , as u — y oo . 
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The next lemma supplies an estimate for how often Q(n) is unusually large; this may be 
deduced from the Theorems in Chapter of [HT88]. 

Lemma 2.3. The number of integers n ^ x for which Q(n) ^ a log 2 x is 

' x(\ogx)- Q ( a) ifl<a<2, 
xi\ogx) 1 - alo ^ 2 \og 2 x if a ^2, 

where Q(X) = fflogtdt = Alog(A) - A + 1. 

We also require a simple estimate for the decay of the Poisson distribution near its left 
and right tails. 

Lemma 2.4 (see [For98a, Lemma 2.1]). If z > and < a < 1, then 



Ez" /ey 
M < W 



where Q(-) is defined as in Lemma 2.3. 
Proof. We have 

In the remainder of this section, we give a precise meaning to the term "normal prime" 
alluded to in the introduction and draw out some simple consequences. For S ^ 2, a prime 
p is said to be S -normal if the following two conditions holds for each / G {0, <r}: 

(2.1) Q(f(p),l,S)^2log 2 S, 

and, for every pair of real numbers (U, T) with S ^ U < T ^ /(p), we have 

(2.2) \Q(f(p),U,T) - (log 2 T -log 2 U)\ < v/log 2 51og 2 T. 

This definition is slightly weaker than the corresponding definition on [For98a, p. 13], and so 
the results from that paper remain valid in our context. As a straightforward consequence 
of the definition, if p is S-normal, / G {0, a}, and f(p) ^ S, then 

(2.3) n(/(p))<31og 2 /(p). 

The following lemma is a simple consequence of [For98a, Lemma 2.10] and (1.2): 

Lemma 2.5. For each f G {0, a}, the number of f -values ^ x which have a preimage 
divisible by a prime that is not S-normal is 

«^ / (o;)(log 2 x) 5 (log5)- 1 / 6 . 

We also record the observation that if p is S'-normal, then P + (f(p)) cannot be too much 
smaller than p, on a double-logarithmic scale. 

Lemma 2.6. If 5 ^ p ^ x is an S-normal prime and f{p) ^ S, then 

log 2 P + (f(p)) \og 2 p _ \og 3 x + log 4 
log 2 a; ^ log 2 :r log 2 a; 

Proof. We have P + (f(p)) > /(p) 1 /^/^)) ^ y(p) 3i og2 /( P > ^ piiss^_ The result follows upon 
taking the double logarithm of both sides. □ 
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2.2. Structural tools. In this section, we describe more fully some components of the 
structure theory of totients alluded to in the introduction. Given a natural number n, write 
n = po(n)pi(n)p2(n) • • • , where po(n) ^ Pi(n) ^ p 2 (n) . . . are the primes dividing n (with 
multiplicity). For a fixed x, we put 



Xi(n;x) 



^§g^ if % < Q(n) and Pi (n) > 2, 
if % ^ Q(n) or Pi(n) = 2. 

Suppose L ^ 2 is fixed and that £j ^ for ^ % ^ L — 1. Recall the definition of the Oj from 
(1.3) and let be the set of (xi, . . . , x£) £ K L with ^ x^ ^ x^-i ^ . . . ^ x± ^ 1 and 

(Jo) aiXi + a 2 x 2 H h a L x L < £ , 

(Ji) aix 2 + a 2 x 3 H h a L _iX L ^ ^i^i, 

(Jl-2) ai^L-i + a 2 a; L < £l-2^l-2- 

Define Jl(£) as the volume (L-dimensional Lebesgue measure) of ^l(£)- For convenience, 
let 1 = (1, 1, ... , 1), y L = ^l(I) (the "fundamental simplex"), and let T L be the volume of 
y L . Let 

L (x) := |2C(log 3 x - log 4 x)\ , 

where C is defined as in the introduction. The next lemma, which appears as [For98a, 
Theorem 15], allows us to locate the preimages of almost all /-values within suitable sets of 
the form y L {€)- 

Lemma 2.7. Suppose ^ ^ < L (x), L = L — ^/ , and let 

6 = «<<*> = 1 + m^w (°^< L - 2 '' 

The number of f -values v ^ x with a preimage n for which (xi(n; x), . . . , xl{ji; x)) £ ^l(£) 



is 



< V f (x) exp(-^ 2 /4C). 



For future use, we collect here some further structural lemmas from [For98a]. The next 
result, which follows immediately from (1.2) and [For98a, Lemma 4.2], concerns the size of 
sums of the shape appearing in the definition of inequality (J ) above. 

Lemma 2.8. Suppose L ^ 2, < uj < 1/10, and x is sufficiently large. The number of 
f -values v ^ x with a preimage satisfying 

aiX\{n; x) + • • • + a^x^n; x) ^ 1 + u> 



is 



< V f (x) (log 2 xf (log x) ~ u 2/(150L3 log L) . 



We will make heavy use of the following (purely geometric) statement about the simplices 
it appears as [For98a, Lemma 3.10]. 

Lemma 2.9. //xG <^l(£) an d • ' '££-2 ^ 1-1^ ^en Xj ^ Sg^^Xi when i < j, and 

Xj < 3^ for 1 ^ j ; ^ L. 
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Define M L (£; x) as the set of integers n with Q(n) ^ L and 

(x (n;x),a:i(n;x),. . . , x L -i(n; x)) G <5*z,(£). 

For / e put 

The next lemma, extracted from [For98a, Lemma 3.12], relates the magnitude of R^\^;x) 
to the volume of the fundamental simplex Tl, whenever £ is suitably close to 1. In [For98a], 
it plays a crucial role in the proof of the upper-bound aspect of (1.2). 

Lemma 2.10. // 1/(1000A; 3 ) < u L( ^ k ^ l/(10Ar 3 ) for 1 ^ k ^ L , & = 1 + Wi /or eac/i %, 
and L ^. L , then 

R { l\i;x)^i\og 2 x) L T L 

for both f = <t> and f — a. 

While only the case / = of Lemma 2.10 appears in the statement of [For98a, Lemma 
3.12], the / = a case follows trivially, since a{n) ^ 4>{n). In order to apply Lemma 2.10, 
we need estimates for the volume T L ; this is handled by the next lemma, extracted from 
[For98a, Corollary 3.4]. 

Lemma 2.11. Assume 1 ^ & ^ 1.1 for ^ % ^ L - 2 and that ■ • -££_ 2 x 1- If 

L = L Q - * > 0, £/ien 

(log 2 x) L T L (£) « F(:r)exp(-M> 2 /40). 

i/ere 

(2.4) := exp(0(log 3 x - log 4 x) 2 + D log 3 a; - (D + 1/2 - 2C) log 4 x). 

We conclude this section with the following technical lemma, which will be needed when 
we select the sets ^ and srf in §3. 

Lemma 2.12. For f e {(f), cr} and y ^ 20, 

(2.5) £ 

^-^ w log 3 -u 

p+(i;)<i/ 

where Y is as defined in (2.4). Moreover, for any b > 0, 

'\og 3 x 



(2.6) F(exp((logx) b )) <^ b Y(x) 



log 2 a; 



-2Clogf> 



Proof. We split the left-hand sum in (2.5) according to whether or not v ^ y log2V . The 
contribution of the large v is 0(1) and so is negligible: Indeed, for t > y lo ^y , we have 
> log 2 y. Thus, by Lemma 2.2, we have ^(t,y) <^ t/(logt) 2 (say), and the 0(1) bound 
follows by partial summation. We estimate the sum over small v by ignoring the smoothness 
condition. Put X = y lo ^y . Since Vf(t) x -^Y{t), partial summation gives that 



E 1 



ver f 
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as y — > oo. (The last equality follows, e.g., from L'Hopital's rule.) Since log 2 X/log 3 X ~ 
log 2 !// log 3 y and Y(X) ~ Y(y), we have (2.5). Estimate (2.6) follows from the definition of 
Y and a direct computation; here it is helpful to note that if we redefine X := exp((loga;) 6 ), 
then log 3 X = log 3 x + log b and log 4 X = log 4 x + O b (l/ log 3 x). □ 

3. Definition of the sets ^ and srf a 

We continue fleshing out the introductory sketch, giving precise definitions to the preimage 
sets =2^, and srf a . Put 

(3.1) L:= [L (x)-2^h^\, & := 1 + (1 ^ i< L). 

The next lemma is a final technical preliminary. 

Lemma 3.1. Let f e {0, cr}. T/ie number of f -values v ^ x with a preimage n for which 

(i) (xi(n;a;), . . . ,x L (n;x)) e S"l($), 

(ii) n has fewer than L + 1 odd prime divisors (counted with multiplicity) , 
is <C Vf(x)/ log 2 x. 

Proof. We treat the case when f = <p; the case when f — a requires only small modifications. 
We can assume that x/\ogx ^ n ^ 2rrlog 2 a;, where the last inequality follows from known 
results on the minimal order of the Euler function. By Lemma 2.3, we can also assume that 
Q(n) ^ 10 log 2 x. Put Pi := Pi(n), as defined in §2.2. Since (xi(n; x), . . . , xl(ti] x)) G 
by hypothesis, Lemma 2.9 gives that X2 < 3o 2 < 0.9, and so p 2 ^ exp((loga;) ' 9 ). Thus, 

n/(p Pi) = P2P3 ■ ■ • < exp(10(log 2 x)(loga;) - 9 ) = x o(1) , 

and so po ^ ^ 2 ^ 5 (say) for large x. In particular, we can assume that p\ \ n. 

Suppose now that n has exactly L — k + 1 odd prime factors, where we fix k > L — L. 
Then 

v = (p - l)0(pip 2 • • -PLo-kW 
for some integer s ^ 0. Using the prime number theorem to estimate the number of choices 
for p given p\ ■ ■ -pL -k and 2 s , we obtain that the number of v of this form is 

x 1 x - 1 

(We use here that — ^ Po ^ a: 2 / 5 .) The sum over s is < 1. To handle the 

remaining sum, we observe that pi---pL -k belongs to the set &L -k{£.k-> x )-> where £ k := 
(£oj • • • , £,L -k-2)- Thus, the remaining sum is bounded by 

So by Lemmas 2.10 and 2.11, both of whose hypotheses are straightforward to verify, 

R { £-k(Zk,x) « (hg 2 x) L °- k T Li) „ k *C (\og 2 x) L «- k T Li) _ k (t k ) « Y(x)exp(-k 2 /4C). 

Collecting our estimates, we obtain a bound of 

< ^—Y{x) exp(-k 2 /4C) < V^(rr) exp(-A; 2 /4C). 
log a; 
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Now since L — L > 2^J\og 3 x, summing over k > L — L gives a final bound which is 

< V^x) exp(-(log 3 x)/C) < ^(x)/log 2 x, 



as desired. 

For the rest of this paper, we fix e > and assume that x ^ x (e). Put 



□ 



(3.2) 



S:=exp((log 2 x) 36 ), 5: 



{\og 2 x)-^l\ 



log 2 X ' 



For / G {0, &}, let be the set of n = po(n)pi(n) ■ ■ ■ satisfying f{n) ^ x and 

(0) n ^ xj logo;, 

(1) every squarefull divisor m of n or /(n) satisfies m ^ log 2 x, 

(2) all of the primes Pj(n) are S- normal, 

(3) fi(/(n)) ^ 101og 2 x and H(n) ^ 101og 2 x, 

(4) If d || n and d ^ exp((log 2 x) 1 ' 2 ), then ^ 101og 2 /(d), 

(5) (xi(n; x), . . . , x L (n; x)) G ^l(^), 

(6) n has at least L + l odd prime divisors, 

(7) P + (f(po)) > x 1 ^, pi(n) < x 100l °^*, 

(8) aiXi H h a L Xi ^ 1 - u. 

The following lemma asserts that a generic /-value has all of its preimage in srf$. 

Lemma 3.2. For each f G {(f), a}, the number of f -values ^ x with a preimage n G" srff is 



(i) The &/f not only satisfy Lemma 3.2 but do so economically. In fact, from condition 
(5) and the work of [For98a, §4], we have that <C Vf(x). Thus, on average, 
an element of ~ff{x) has only a bounded number of preimages from g/f. So when 
we turn in §§4- 5 to counting 0- values arising from solutions to 0(a) = cr(a'), with 
(a, a') G x g/ a , we expect not to be (excessively) overcounting. 

(ii) Of the nine conditions defining &/f, conditions (0)-(4) are, in the nomenclature of the 
introduction, purely anatomical, while conditions (5)- (8) depend to some degree on 
the fine structure theory of [For98a]. Condition (8) is particularly critical. It is (8) 
which ensures that the sieve bounds developed in §4 result in a nontrivial estimate 
for V^ j<7 (x). Our inability to replace the exponent \ on log 2 x in Lemma 3.2 (or in 
Theorem 1.1) by a larger number is also rooted in (8). 

Proof. It is clear that the number of values of f(n) corresponding to n failing (0) or (1) is 
<C x log 2 xj logx, which (recalling (1.2)) is permissible for us. By Lemma 2.5 and our choice 
of S, the number of values of f(n) coming from n failing (2) is <C V/(x)/log 2 x. The same 
holds for values coming from n failing (3), by Lemma 2.3. 

Suppose now that n fails condition (4). Then n has a unitary divisor d > exp((log 2 x) 1 / 2 ) 
with Q(f(d)) > 101og 2 /(d). Put w := f(d). Then w | f(n), and f(n) < xlog 2 x. So if 
w ^ x 1 / 2 , then the number of possibilities for f(n) is 



«V>(^)(log 2 a:)- 1/2+£ . 



Remarks. 





1 

- < 



logx 



w 



O(«;)^101og 2 «; 
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using Lemma 2.3 to estimate sum over w. If w ^ x 1 ^ 2 , we observe that f(n)/w = f(n/d) E 
and deduce that the number of corresponding values of f{n) is 

« Y v f (x/ W )^-^-Y(x) y 

^ J logo; ^ w log 2 x 

exp((log 2 x) 1 ' 3 )^^ 1 ' 2 «>>cxp((log 2 x) 1 ' 3 ) 

Q(w)^10 log 2 w S!(iu)^101og 2 w 

By Lemma 2.7, the number of /-values with a preimage failing (5) is <C Vf(x)/ log 2 x. 
According to Lemma 3.1, the number of /-values with a preimage satisfying (5) but not (6) 
is also <C Vf(x)/ \og 2 x. 

Suppose now that n satisfies (0)-(6). From (5), we have l + £o ^ aiX\ + a 2 x 2 ^ (a 1 + a 2 )x 2 , 
and so x 2 ^ 0.8. So from (3), 

(3.3) f = P2(n)p 3 (n) • • • < exp(10(log 2 x)(loga;) a8 ) < rr 1 / 100 . 

In particular, p > x 1 ! 3 + 1 an d /(po) > x 1 ^ 3 , so that t> := f(piP2 • • • ) ^ a^ 2 ^ 3 - The prime po 
satisfies /(po) ^ ^/^ ■ F° r -2 with rr 1 / 3 < z ^ re, the number of primes p with /(po) ^ z an d 
P + (f(Po)) < x 1/log22: is (crudely) bounded by a; 1/loS2 x ) < ^/(logx) 2 , by Lemma 2.2. 
So the number of values of f(n) coming from n with P + (f(p )) ^ 

x l/log 2 x ig 



Ev — -v _ X \ — \ 1 X 
/ 1 < 71 \? > - < 71 75 — • 

2/3 / (logx) 2 Z -- u (loga;) 2 - e 

wsCa; 2 / 3 p: f(pHx/v vs^x 2 ' 3 

ver f p+(f( P ))^x 1 ^ lo s2 x veTf 

To handle the second condition in (7), observe that since f(po) ^ x/v, the prime number 

theorem (and the bound v ^ x 2 ^ 3 ) shows that given v, the number of possibilities for p is 

i 

<C x/(vlogx). Suppose that pi(n) > a; 1001082 * . Then x\ = x\{n\x) ^ 0.999, and we conclude 
from X^oi a i x i ^ Co that either x 2 ^ £ 3 / 2 or rr 3 ^ £> 5 / 2 . Writing v 2 for /(P2P3 • • • ) and f 3 for 
/(P3P4 • ' • )) we see that the number of such /-values is 



x 



log x ^ pi ( ^ v 2 ^ p 2 ^ 

P1 ' \P+(v 2 )ticxp((logx)e- i/2 ) P2 P+(v 3 )^cxp((logx)e 5/2 ) 

« ^- log 3 x C KM f J*^ + (log, X )Y( X) ( ^) 3 ' 2 ) « W 



logo; I \log 2 x/ \log 2 xy J " (log 2 a;) 1 / 2 e ' 

using Lemma 2.12 to estimate the sums over v 2 and t> 3 . 

Finally, we consider n for which (0)-(7) hold but where condition (8) fails. By Lemma 
2.8, we can assume that 

a-iXi H h a L x L < 1 + u, 

since the number of exceptional /-values is <C V(x) exp(— (log 2 x) e l 2 ) <C V(x)/ log 2 x. Thus, 

(3.4) 1 — to < a i x 1 + h a L x L < 1 + to, 

while by condition (J^ in the definition of ^l(£)> 

a x x 2 H h a L ^x L ^ £1X1. 
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We claim that if J is fixed large enough depending on e, then there is some 2 ^ j ^ J with 
Xj ^ Qi~ e / 3 . If not, then for large enough J, 

j-i 

6^1 ^ E a * x »+i ^ Q 1 ~ e/z (aiQ + a 2 g 2 H h aja J ) > g x ~^ 4 . 

3=1 

Thus, X! > g 1 ^ 4 ^ 1 > g 1 ^ 5 , and so 

£o > a~ e/5 (aia + a 2 a 2 + • • • + ajg J ) > g~ e/6 , 

which is false. This proves the claim. We assume below that j e [2, J] is chosen as the 
smallest index with Xj ^ Q^~ e ^\ by condition (1), this implies that all of p±, . . . ,Pj-i appear 
to the first power in the prime factorization of n. 

Now given x 2 , ■ ■ ■ , xl, we have from (3.4) that x\ e [a, a + 2u] for a certain a. Thus, 

— < <*> log 2 x = (log 2 a;) 1/2+e/2 . 
pi ^ 

So the number of /-values that arise from n satisfying (0)-(7) but failing (8) is 

«^E E — - — E - 

j-2 pi,..., P j-i P+(«)^exp((logx)^ 

This completes the proof of Lemma 3.2. □ 

As a corollary of Lemma 3.2, we have that V^ c {x) is bounded, up to an additive error of 
<C (V<f>(x) + V a (x)) / (\og 2 x) 1 / 2 ~ t , by the number of values 0(a) that appear in solutions to 
the equation 

0(a) = a(a'), where (a, a') G ^ x srf a . 

In §§4-5, we develop the machinery required to estimate the number of such values. Ul- 
timately, we find that it is smaller than (V^(x) + V a (x)) /{\og 2 x) A for any fixed A, which 
immediately gives Theorem 1.1. 

4. The fundamental sieve estimate 

Lemma 4.1. Let y be large, k ^ 1, I ^ 0, e e ^ S ^ Vk ^ Vk-i ^ ■ ■ ■ ^ v o = V , an d 
uj ^ vj for ^ j ^ k - 1. Put5= ^/log 2 S/ log 2 y, ^ = log 2 u^-/ log 2 y, ^ = log 2 u-j/ log 2 y. 
Suppose that d is a natural number for which P + (d) ^ v^. Moreover, suppose that both of 
the following hold: 

(a) For 2 ^ j ; ^ k — 1, either (nj,Vj) = (fij-i, ^j-i) or Uj ^ /ij„i — 25. Also, ^ 
/ifc_i - 25. 

(b) For 1 ^ j ^ A; — 2, we have uj > Vj+ 2 . 
The number of solutions of 

(4.1) (p - 1) • • • (Pfc-i - = (g + 1) • • • (<fc-i + l)e ^ y 

in p , • • ■ ,Pfc-i, ?o, • • • , 9fc-i, e, / satisfying 
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IS 



(i) Pi and qi are S-normal primes; 

(ii) m ^ P + ( Pt - 1), P + (q t + l)^Vifor0^i^k-l; 

(iii) neither (fi(Yli=o Pi) nor a (Y[i=o ?») ^ s divisible by r 2 for a prime r ^ v k ; 

(iv) P + (ef)^v k ;Q(f)^4llog 2 v k ; 

( v ) Po — 1 a divisor ^ y 1 / 2 which is composed of primes > v±; 

« V -{c\og 2 yf k {k + l) n W(log^) 8 ( fe+ ^ log ( fc+1 ) +1 (logy)- 2+ ^ la ^ i+E , 

where E = 5 Yli=2 + + 2 XliJiC^i — A 4 *)- -^ ere c ^ s an absolute positive constant. 

Remarks. Since the lemma statement is very complicated, it may be helpful to elaborate on 
how it will be applied in §5 below. Given (a, a') G stf^ x srf a satisfying 0(a) = a (a'), rewrite 
the corresponding equation in the form (1.6), with d, e, and / as in (1.7). (Here L is as in 
(3.1), and k, given more precisely in the next section, satisfies k ~ L/2.) We are concerned 
with counting the number of values 0(a) which arise from such solutions. We partition the 
solutions according to the value of d, which describes the contribution of the "tiny" primes 
to 0(a), and by the rough location of the primes Pi and which we encode in the selection 
of intervals [i/j,i>j] (cf. Lemma 2.6). Finally, we apply Lemma 4.1 and sum over both d and 
the possible selections of intervals; this gives an estimate for the number of 0(a) which is 
smaller than (V^(x) + V a (x)) / (\og 2 x) A , for any fixed A. 

In our application, conditions (i)-(v) of Lemma 4.1 are either immediate from the defini- 
tions, or are readily deduced from the defining properties of srf^ and stf a . Conditions (a) and 
(b) are rooted in the observation that while neighboring primes in the prime factorization 
of a (or a') may be close together (requiring us to allow [u i+ i,v i+ i] = [ui,Vi]), the primes 
Pi(a) and pi + 2{a) are forced to be far apart on a double-logarithmic scale. Indeed, since 
(xi(a;x), . . . ,xl(o>',x)) G Lemma 2.9 shows that x i+2 < 3g 2 Xi < 0.9xj. 

Proof. We consider separately the prime factors of each shifted prime lying in each interval 
(v i+ i, Vi}. For < j < k — 1 and ^ % ^ k, let 

fc-1 k-l 

Sij(n)= Y[ p a , <» = JJ p a , s i = dfY[s i j = eY[s' id . 

P a \\(j>j-1) P a ||fe + 1) 3=0 3=0 

Also, for ^ j ^ k — 1 and 1 ^ % ^ k, let 



U 



s i-l,j ,i _ S i-l,j 



k-l k-l 



t i,j - , ' j > U - y\ u,j - Y\_ ti, 



1,3 i v i,j I ■> "i 11 v h3 11 H,j- 

For each solution = (p , . . .,Pk-i, f,q ,..., q k -i, e) of (4.1), let 

Oi{sf) = {su s it o, s i>fc _i, /; s- , . . . , s-fc-i, e }> 
Ti{&f) = {U, t ij0 , . . . , tj,fc-i, 1; t'i o, t' i k _ 1 , 1}. 

Defining multiplication of (2k + I + 2)-tuples by component-wise multiplication, we have 

(4.2) <Ti_i(j2^) = Oi(srf)Ti(£rf). 
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Let &i denote the set of (Ti(jz/) arising from solutions stf of (4.1) and Xj the corresponding 
set of Ti{s/). By (4.2), the number of solutions of (4.1) satisfying the required conditions is 

(4.3) | 6o | = E E L 

o-i66i neSi 

First, fix <7i G 61. By assumption (v) in the lemma, t lfi ^ y 1 / 2 . Also, ti = t lfi = t' 10 ^ 
y/si, ti is composed of primes > v±, and also si^i + 1 and s' 10 t 1 — 1 are prime. Write 
ti = t[Q, where Q = P + (ti). Since p is an S'-normal prime, (2.3) gives that 

q ^ t V^(*i) ^ ^/nCpo-i) ^ ^i/(2n(po-i)) ^ ^l/ceiogaj/)^ 

Given t[, Lemma 2.1 implies that the number of Q is 0(y(log 2 y) 6 /(s 1 t[ log 3 y)). Moreover, 

t i A A \ p p l I log f 1 

Consequently, for each o\ G ©1, 

(4.4) E 1~ " (l0g2 " )6 



Next, suppose 2 ^ i ^. k. We now apply an iterative procedure: If t>j < t>j_i, we use the 
identity 

< 45 > £ r~r^7 £ r 

If Vi = Vi-i, then (4.5) remains true but contains no information, and in this case we use the 
alternative identity 

< 46 > £ j-= £ 7- £ ^ 

ffi-i66i_i o- i+1 e©i+i n + i& i+1 

We consider first the simpler case when Vi < i>j_i. Suppose <7j G &i, Tj G Xj and <7jTj G 
©i_i. By assumption (ii), 

ti = tifi ■ ■ ■ t^i-i = t i ■ ■ ■ t i i _ 1 . 
In addition, s^-i^-i + 1 = Pi-i and s'^^t'^^ — 1 = are prime. Let Q := P + (tj 5 j_i), 
Q' := P+^J), 6 :'= U^/Q and 6' := i'^/Q'. 

We consider separately X^i, the set of Tj with Q = Q' and X ij2 , the set of Tj with Q 7^ Q' . 
First, 

E,:= V I«V^ ma .xVi 

where denotes the number of solutions of t i0 ' ' •^,?-2^ = ^ = ^0 ' ' '^n-$ ■, an d m the 
sum on Q, Si y i_\bQ + 1 and s^^b'Q — 1 are prime. By Lemma 2.1, the number of Q ^ z is 
<C ^(log z) _3 (log 2 y) 3 uniformly in b,b'. By partial summation, 

E ^«(log 2 y) 3 (logy)- 2 ^\ 
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Also, h(t) is at most the number of dual factorizations of t into % factors each, i.e., h(t) ^ 
j2n(t) _ gy (2.2), < — Ui + 5) \og 2 y =: /. Also, by assumption (iii), t is squarefree. 

Thus, 

(4.7) 



t j^i J 



where 

^2 ~ ^ (^-! ~~ ^) lo S2 1/ + 1 =: H. 

By assumption (a), z/j_i — z/j ^ 25, hence / ^ ^ f^ 2 -^- Applying Lemma 2.4 (with 

a ^ |) to estimate the right-hand side of (4.7), we find that 

(4.8) < ( £ T") / ^ ^ = {\ogyf +llo ^-^ +&) . 
This gives 

Iq < (log^)^^!/)" 2 ^-^^ 106 ^^- 1 "^^. 

For the sum over % ij2 , set U = tQQ'. Note that 

tQ' = tj,o • • • U : i-2b, tQ = t' i0 ■ ■ ■ t't i^b', 

so Q | t' i ■ ■ ■ t' i if _ 2 U and Q' \ t i>G ■ ■ ■ t i:i - 2 b. If we fix the factors divisible by Q and by Q', then 
the number of possible ways to form t is ^ i 2n ^ as before. Then 

1 i2fl(t)+2 j 

E 2 := > — ^ > max > , 

U ^ t b,b> QQ r 

where s i:i _ibQ + 1 and s'^^b'Q' — 1 are prime. By Lemma 2.1, the number of Q ^ z 
(respectively Q' ^ z) is <C z{\ogz)~ 2 (\og 2 y) 2 . Hence, 

En^«( 1 °g2#(logy)^- 1 - 

Q,Q' W 

Combined with (4.8), this gives 

S 2 < i 2 (log 2 y) 4 (logy)- 2w - 1+(i+ilogi)(j/i - 1 - i/<+,5) . 
From (a) and (b), i 2 ^ k 2 ^ (\og 2 y) 2 . Adding S x and £ 2 shows that for each <7j, 

(4.9) ^ - < (logs^^logy)- 2 ^- 1 ^ 108 ^^- 1 -^^. 

We consider now the case when Vi = i>j_i. Set Qi := P + (tj + i 5 j_i), Q2 : = 
Q 3 := -P + (*i + i ji _i), and Q 4 := P + (t- +M ). From (iii), we have that Q x ^ Q 2 and Q 3 ^ Q 4 . 
Moreover, letting bi denote the cofactor of Qi in each case, we have that 

S i+M _i&iQi + 1 = Pi-i, s'i+ij-ihQz - 1 = 

(4.10) s i+1<i b 2 Q 2 + l=Pi, s' i+lti hQ 4 - 1 = ft. 



ON COMMON VALUES OF <f>(ri) AND a(m), II 15 

Since there are now several ways in which the various Qi may coincide, the combinatorics 
is more complicated than in the case when < i>j_i. We index the cases by fixing the 
incidence matrix (Sij) with 5ij = 1 if Qi = Qj and 5ij = otherwise. 

Write D = gcd{Q 1 Q 2 ,Q 3 Q 4 ), and let Q := Q1Q2/D and Q' := Q3Q4/D, so that D,Q, 
and Q' are formally determined by (<%). Then QQ' \ t i+ ±, and writing t i+i /D = tQQ', we 
have 

(4.11) tQ = tj+1,0^+1,1 ' ' ' ^+l,j-2&3^4, 

(4.12) tQ' = t' i+lfi t' i+hl ■ ■ ■ 4hi,i-2&i&2- 

We now choose which terms on the right-hand sides of (4.11) and (4.12) contain the prime 
factors of Q and Q', respectively; since Q(Q) ^ 2 and il(Q') ^ 2, this can be done in at 
most {i + l) 4 ways. Having made this choice, the number of ways to form t is bounded by 
(i + l) m{t \ and so 

(4.13) e ^<E" : + 1)2 "'" +4 — E — • 

*-± t i+1 V t 61,62,63,64 DQQ' 

Ti + lgTi+l t 

It is easy to check that DQQ' = Ylje^fQv where ^ indexes the distinct Qj. For each 
j G ( /, let rij be the number of linear forms appearing in (4.10) involving Qj. Since each of 
these rij linear forms in Qj is prime, as is Qj itself, Lemma 2.1 implies that the number of 
possibilities for Qj ^ z is <C z{\ogz)~ nj ~ 1 {\og 2 y) nj+1 , and so 



E 7T« (log 2 yr +1 (log^-i)- n ' « (log 2j /)^ +1 (lo gJ /)- n ^-i, 
uniformly in the choice of the 6's. Since j % = 4 and Ylijej 1^4, 

< 414 > Eb^< II ( E ^)« (lofeyJ'dogrt-*- . 

The calculation (4.8), with i replaced by i + 1, shows that 

(4.15) ^ - + ^ 2nW ^ (logy)« i+1 ) + ( i+1 ) log ( i+1 »^-^+ 1+ ^. 

Combining (4.13), (4.14), and (4.15) shows that 

(4.16) — < (* + l) 4 (log 2 y) 8 (logy)- 4 ^- 1+ « l+1 ) + ( t+1 ) los ( l+1 »^-^ 1+5 ) 



T i + lfcij + l 
""i+lTj + lGSi-l 



^ (log 2 y) 12 (log|/)~ 2Mi - 1+ ( ilogi+i ^^ _ ^- 1 ^ 2 ^ + ^ i+1 ) lo s( i + 1 )+( i + 1 ))(^+i-^+ <5 ) 

where in the last line we use that t> j_i = Vi and (i + l) 4 ^ A; 4 ^ (log 2 y) 4 . 

Using (4.3), (4.5), and (4.6) together with the inequalities (4.4), (4.9), and (4.16), we find 
that the the number of solutions of (4.1) is 

< y( c log 2 i/) 6fe (logt/)~ 2 ~ iyi+E -2^- 1 -^ +<5 ^ ilogi+i )- 2 ^- 1 V — , 
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where c is some positive constant. Note that the exponent of logy is ^ — 2 + YliZi a i v i + E. 

It remains to treat the sum on o~k- Given s' k = Sk/d, the number of possible Ok is at most 
the number of factorizations of s k into k + 1 factors times the number of factorizations of 
ds' k into k + 1 factors, which is at most (k + l) n ( ds 'k\k + l) n K). By assumptions (i) and (iv), 
n(s' k ) ^ 4(k + l)\og 2 v k . Thus, 

1 (k+ lfW(k+ l)8(fc+01°g 2 «* 1 (k + l)^ d \logV k ) S{k+l)los{k+1)+1 

^ 7 k ^ d ^ 7 k <<c 1 • D 

5. Counting common values: Application of Lemma 4.1 

In this section we prove the following proposition, which combined with Lemma 3.2 im- 
mediately yields Theorem 1.1. Throughout the rest of this paper, we adopt the definitions 
of L, the £j, S, 5, and u from (3.1) and (3.2). 

Proposition 5.1. Fix A > 0. For large x, the number of distinct values of (ft (a) that arise 
from solutions to the equation 

0(a) = cr(a'), with (a, a) G stf^ x g/ a , 

is smaller than (V^x) + V a (x)) / (log 2 x) A . 

Let us once again recall the strategy outlined in the introduction and in the remarks 
following Lemma 4.1. Let (a, a') G =2^ x srf c be a solution to 0(a) = a (a!). Let pi := Pi(a) 
and a; := Pi(af), in the notation of §2.2. We choose a cutoff /c so that all of po, . . . ,Pk-i and 
<7o, • • • , <?fc-i are "large". Then by condition (1) in the definition of the sets srff, neither p\ \ a 
nor qf \ a', for ^ i ^ k — 1. Fixing a notion of "small" and "tiny", we rewrite the equation 
0(a) = cr(a') in the form 

(5.1) (po - 1) • • • (Pfc-i - = (g + 1) • • • + l)e, 

where / is the contribution to 0(a) from the "small" primes, d is the contribution from the 
"tiny" primes, and e is the contribution of both the "small" and "tiny" primes to a (a'). 

We then fix d and numbers Ui and Vi, chosen so that Ui ^ P + (pi — 1), P + (qi + 1) ^ f» for 
each ^ i ^ fc — 1. With these fixed, Lemma 4.1 provides us with an upper bound on the 
number of corresponding solutions to (5.1). Such a solution determines the common value 
0(a) = cx(a') G fl We complete the proof of Proposition 5.1 by summing the upper 
bound estimates over all choices of d and all selections of the and i>j. 

We carry out this plan in four stages, each of which is treated in more detail below: 

• Finalize the notions of "small" and "tiny", and so also the choices of d, e, and /. 

• Describe how to choose the Ui and Vi so that the intervals [m*, i>j] capture P + {j>i — 1) 
and P + {qi + 1) for all ^ % ^ k - 1. 

• Check that the hypotheses of Lemma 4.1 are satisfied. 

• Take the estimate of Lemma 4.1 and sum over d and the choices of Ui and V{. 

5.1. "Small" and "tiny". Suppose we are given a solution (a, a') G sf^ x srf c to 0(a) = 
cr(a'). Set = Xj(a; x) and ?/j = Xj(a'; x), in the notation of §2.2, so that (from the definition 
of srff) the sequences x = (xi, . . . ,x L ) and y = (yi, . . . , y L ) belong to ^l(C)- 

Lemma 5.2. {zj}j =1 denoting either of the sequences {xj} or {yj}, we have 

(i) Z j < 3^ for 1 < j < L, 
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(ii) z L -j ^ i§oe"V lc, g2 xfor0^j<L. 

(iii) z i+2 < 0.9^ /or 1 < j < L - 2. 

Proof. Claim (i) is repeated verbatim from Lemma 2.9. By the same lemma, Zj ^ 3^ _i Zj 
for 1 ^ % < j ^ L. This immediately implies (iii), since g 2 < 0.9. Moreover, fixing j = L, 
condition (6) in the definition of srff gives that 

Zi > \t L ZL > X ^^Q'- L l\o^X > JLg^ L /hg 2 X, 

which is (ii) up to a change of variables. □ 
Lemma 5.3. The minimal index ko ^ L for which 

log 2 P + (^ () -l)<(log 2 x) 1 / 2+e / 10 . 
satisfies k ~ (1/2 — e/10)L as x — > oo. 

Proof. Lemma 5.2(i) shows that the least K with \og 2 pK < (log 2 a;) 1 / 2 + e / 10 satisfies K ^ 
(1/2 — e/10 + o(l))L, as x — > oo. Since \og 2 P + (p K — 1) ^ log 2 px, this gives the asserted 
upper bound on k . The lower bound follows in a similar fashion from Lemma 5.2 (ii) and 
Lemma 2.6. □ 

Recall the definition of 8 from (3.2), and put 

(5.2) r] := 10L6, so that 77 x (log 3 x) 3/2 (log 2 x)~ 1/2 . 

We choose our "large" / "small" cutoff point k by taking k — k if x feo _i — x fco ^ 20?7, and 
taking = k — 1 otherwise. For future use, we note that with this choice of k, 

(5.3) x fc _i - x k > 2O77. 

This inequality is immediate if k = k ; in the opposite case, by Lemma 5.2(iii), 

X k -i ~X k = X ko -2 ~ Xk -l > Xko-2 ~ Xk - 2077 

^ 0.1x feo _ 2 - 2O77 ^ 0.1(log 2 a;)- 1/2+e/10 - 2O77 > 2O77. 

Note that with this choice of k, we have log 2 pi > (log 2 x) l l 2+e l lQ for ^ % ^ A; — 1, and so 
condition (1) in the definition of guarantees that each divides a to the first power only, 
for ^ % ^ k — 1. Moreover, from Lemmas 5.2(h) and 5.3, we have log 2 qi > (\og 2 x) 1 ^ 2+€ ^ u 
for ^ % ^ k — 1. So each gj divides a' only to the first power, for ^ % ^ k — 1. Now take 



(5.4) / := (f)(p k p k+ i ■ ■ -Pl-i), d:= 

and 



<P(,PlPl+i ■■■ ) if Pl-i 7^ Pl, 

<P{PlPl+i ■■■) if Pl-i = Pl, 



e := a(q k q k+ i ■■■), 
and observe that equation (5.1) holds. 
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5.2. Selection of the Uj and Vj. Rather than choose the Uj and Vj directly, it is more 
convenient to work with the \ij and Vj\ then Uj and Vj are defined by Uj := exp((logx) Mi ) 
and Vj := exp^logx) 1 ^). Put 

trr\ si log 3 a; + log 100 A /- » ■ 

(5.5) Co := 1 j , and Q ■= Co ~ JV 0^1), 

iog 2 x 

and note that with v :— 1 and /xq := Co, we have 



,1/(100 log 2 x) 



Mo = x -/v < x l / Xo ^ x sC p+( Po - l),P + (q + 1) <C x = v , 

by condition (7) in the definitions of ^ and £/ a . To choose the remaining /ij and Vj, it 
is helpful to know that pj and g-,- are close together (renormalized on a double logarithmic 
scale) for 1 ^ j ^ k. This is the substance of the following lemma. 

Lemma 5.4. If pj > S and qj ^ S, then 

(5.6) \x j -y j \^(2j + l)5<r l . 

These hypotheses hold if L — j ^ 2Clog 4 x + 12, and so in particular for 1 $C j ^ k. 

Proof. Suppose for the sake of contradiction that yj ^ Xj + (2j + 1)5; since the Pi and % are 
all S'-normal, this would imply that 

1U 62 x 1U 62 x 

which is false. We obtain a similar contradiction if we suppose that Xj ^ yj + (2j + 1)5. The 
second half of the lemma follows from Lemma 5.2 and a short calculation, together with the 
estimate k ~ (1/2 — e/10)L of Lemma 5.3. □ 

We choose the intervals [fij,Uj] for 1 $C j ^ k — 1 successively, starting with j — 1. 
(We select last, by a different method.) Say that the pair {xj,Xj + ±} is well- separated if 
— Xj+i ^ IO77, and poorly separated otherwise. 

In the well-separated case, among all Q (with i ^ 0), choose £ minimal and £' maximal 
with 

C'log 2 x < log 2 min{P + fe - l),P + fe + 1)} 

< log 2 max{P + ( Pj - l),P+(g, + 1)} ^ Clog 2 a;, 

and put 

ft := C, Vj := ('■ 

In the poorly-separated case, j < k — 1, by (5.3). We select [/ij, Uj] = [//j+i, 1^+1] by a similar 
recipe: Among all Q (with % ^ 0), choose £ minimal and £' maximal with 

C'log.x < log 2 min{P + fe - l),P+fe + l),P + fe+i - l),P + (q J+ i + 1)} 

< log 2 max{P + ( Pj - l),P + fe + l),P + fe + i - l)>^ + fe+i + 1)} < Clog 2 x, 

and put 

V 3 = u i+i = C> and A*J = A«j+1 = C- 
To see that these choices are well-defined, note that by (7) in the definition of <g/f, we have 
Xj,yj ^ Co, which implies that a suitable choice of ( above exists in both cases. Also, for 
1 < % ^ k, we have x i: yi ^ (log 2 a;) _1 / 2+e / 11 (by Lemma 5.3 and 5.2(h)). So by Lemma 2.6, 

log 2 min{P + ( ft - l),P + (q t + l)}/log 2 a; > (hg.x)- 1 / 2 ^ 12 , 
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say. Since neighboring Q are spaced at a distance 77 x (log 2 x)~ 1/,2 (log 3 x) 3//2 , a suitable 
choice of (' also exists in both cases. 

For our application of Lemma 4.1, it is expedient to keep track at each step of the length 
of the intervals [fij, Vj], as well as the distance between the left-endpoint of the last interval 
chosen and the right-endpoint of the succeeding interval (if any). In the well-separated case, 
Lemmas 5.4 and 2.6 show that 

Vj < max{xj, Hj} + r] < Xj + 2rj, 

while 

log 3 x + log4 

Hj > mm{^, % } — — 77 

iog 2 x 

(5.7) ^ - 3?7, 

so that 

Vj — /ij ^ 5i]. 

Also, if a succeeding interval exists (so that j + 1 ^ k — 1), then 



^ max{x i+1 , + 77 ^ x i+ i + 2rj, 



and the separation between \ij and z/ J+ i satisfies the lower bound 

(5.8) fij — Vj + i ^ Xj — Xj + i — 5r] ^ 577. 
In the poorly separated case, we have 

Vj ^ max{xj,yj, x j+1 , y j+1 } + 77 = max{i J , yj) +rj ^Xj + 2r], 
as before, but the lower bound on fij takes a slightly different form; 

log 3 x + log4 
fij ^ mm{xj, Vj , x j+1 ,y j+1 } r—— 77 

(5.9) ^ (x j+1 -77) j—— 77 ^ Xj+i - 3?7 ^ - 13?7, 

so that 

Z/j — /ij ^ 15?7. 

In this case, since Vj = Vj +i and fij = fij+i, the succeeding interval (if it exists) is [fij+2, ^+2]- 
By Lemma 5.2(iii), 

Xj - Xj+2 > O.lxj ^ 0.1(log 2 x)- 1/2+e/10 > 2O77, 

say. Thus, 

v j+2 ^ max{x j+2 , 7/ i+2 } + 77 ^ x j+2 + 277 < a^- - I877, 

and so 

(5.10) flj +1 - V j+2 = fij - V j+2 > (Xj - 137/) - (Xj - 187/) > 577. 

At this point we have selected intervals [fij, Vj], for all ^ j ^ k — 1. We choose v k = (, 
where £ is the minimal ^ satisfying ( ^ x fc + 77. Note that 

log 2 5/ log 2 x = 36 log 3 x/ log 2 x < (log 2 x)~ 1/2+e/n ^ x fe < C = v k < £fc + 277. 

Thus, > 5. From (5.7) and (5.9), 

fl k -l ^ x fc _i - 37/, 
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so that also 

(5.11) - v k ^ x k _ x - x k - 5r] ^ 1577, 
where the last estimate uses (5.3). 

5.3. Verification of hypotheses. We now check that Lemma 4.1 may be applied with 
y = x. By construction, S ^ v k ^ v k -\ ^ • • • ^ Vq — x, and Ui ^ t> « for all ^ i ^ k — 1. 
Moreover, if [/ij, Uj] 7^ [pij-i, Vj-i] (where 2 ^ j ^ k — 1), then from (5.8) and (5.10), 

— z/j ^ 5i] = 50L5 > 25, 

and from (5.11), 

A*fe-i -Vk> 1577 > 25. 

Thus, condition (a) of Lemma 4.1 is satisfied. It follows from our method of selecting the fj,j 
and Vj that if Vj = Vj+i, then (again by (5.10)) Vj+2 ^ ^j+i — 5i] < = Uj, which shows 
that condition (b) is also satisfied. Moreover, since v k > x k , we have P + (d) ^ pi ^ p k < v k . 
So we may focus our attention on hypotheses (i)-(v) of Lemma 4.1. We claim that these 
hypotheses are satisfied with our choices of d, e, and / from §5.1 and with 

(5.12) l:=L-k. 

Property (i) is contained in (2) from the definition of g/f. By construction, 

«i< P + ( Pt -l),P + {qi + l) 

for all ^ % ^ k — 1, which is (ii). Since Vk ^ S > logy, property (iii) holds by (1) in the 
definition of &/f. The verification of (iv) is somewhat more intricate. Recalling that v k > x k , 
it is clear from (5.4) that 

P + (f)<Pk^v k . 

To prove the same estimate for P + {e), we can assume e ^ 1. Let r = P + {e), and observe 
that r I <j(R), for some prime power R with R \\ qkqk+i • • • ■ If -R is a proper prime power, 
then from (1) in the definition of £/f, we have r ^ cr(R) ^ 2R ^ 2(loga;) 2 < v k . So we can 
assume that R is prime, and so R ^ q k and 

r ^ P + (R + 1) ^ max{3, R} ^ q k . 

But by Lemma 5.4, 

log 2 q k / log 2 x = y k ^ x k + (2k + 1)5 < x k + r] ^ v k . 

Thus, P + (e) — r ^ v k . Hence, P + (ef) ^ v k . Turning to the second half of (iv), write 
Pk ■ ■ -Ph-i = AB, where A is squarefree, B is squarefull and gcd(A, B) = 1. Recalling (2.3), 
we see that 

n{<j>(A)) ^ 3Q(A)\og 2 v k ^ 3l\og 2 v k , 

with I as in (5.12). Let B' be the largest divisor of a supported on the primes dividing B, 
so that B' is squarefull and B \ B' . By (1) in the definition of srff, we have B' ^ (log a;) 2 . If 
B' ^ exp((log 2 x) l l 2 ), then (estimating crudely) 

£l(<t>(B)) ^ n(cf>(B')) ^ 2 logos') < 2 log B' ^ 2(log 2 :r) 1/2 . 

On the other hand, if B' > exp((log 2 x) 1 ^ 2 ), then by (4) in the definition of srff, 

n((j)(B)) < tt((f)(B')) <: lOlog 2 0(/3') < 101og 2J B' < log 3 a;. 
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Since log 2 t> fc = u k \og 2 x > r]\og 2 x > (log 2 x) 1//2 , we have we have Q(<f>(B)) ^ 2\og 2 v k in 
either case. Hence, 

Q(f ) = n((j>(A)) + n(<f>(B)) ^ (31 + 2) log 2 v k < 41 log 2 Ufc , 

which completes the proof of (iv). Finally, we prove (v): Suppose that b ^ x 1 ^ 3 is a divisor 
of po — 1. Recalling again (2.3), 

P + (&) ^ &V«(P0-1) ^ &3l3i^ ^ x 9lSi^ > ^lOOlog-^ ^ ^ 

Thus, setting b to be the largest divisor of p — 1 supported on the primes ^ vi, we have 
6 < x 1 / 3 . From (3.3) and conditions (0) and (7) in the definition of 

a rr/logrr 095 

Po = > /100 > a; ' 95 , 

say. Thus, (po — l)/& is a divisor of £>o _ 1 composed of primes > V\ and of size at least 
(p - l)x" 1/3 > a; 9 / 10 ^ 1 / 3 > x 1 / 2 . 

5.4. Denouement. We are now in a position to establish Proposition 5.1 and so also The- 
orem 1.1. Suppose that k and the fa and i/j are fixed, as is d; this also fixes I = L — k. By 
Lemma 4.1, whose hypotheses were verified above, the number of values 0(a) coming from 
corresponding solutions to 0(a) = cr(a'), with (a, a') G =2^ x £/ a , is 

(5.13) « ^(clog 2 x) 6fe (A; + l) n W(log Ufc ) 8 ( fc+I ) to «( fc+1 ) +1 (loga;)- 2+ ^=i 0i, ' i+s 

< ^ exp (O ( (log, x) 2 ) ) (log v k ) L \\ogx) - 2 +^ a ^ +E ' , 
a 

where 

fe-i 

= £ + ^2a i (u i -x i ) 
i=i 

fe fe-i fe-i 

= 5 (i log? + i) + 2 - fa) + ^ ai(z/j - x*). 

i=2 i=l i=l 

By our choice of ^ and /ij in §5.2, we have Vi — /ij <C 77 and z/j — Xj <C 77. Hence, 

£" < 0X 2 log L + 77 {^L + a^ < 5L 2 log L + 77Z 2 log L < 0X 3 log L. 

In combination with (7) from the definition of stf^, this shows that the exponent of logx on 
the right-hand side of (5.13) is at most — 1 — 00 + E' < — 1 — lu/2, and so 

(\ogx)- 2+ ^ aiXi+E ' ^ (log x)" 1 exp (-^(logzx) 1 ' 2 ^' 2 ^ . 

Moreover, by Lemma 5.3 and Lemma 5.2(i), 

(5.14) v k < x k + 2rj^ (log 2 x)- 1 ^ 9 + 2rj^ (log 2 x)- 1 ^ 5 , 
and hence 

(log^ = exp(L 2 (log 2 :r)z/ fc ) < exp((log 2 xf' 2+ ^). 
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Inserting all of this back into (5.13), we obtain an upper bound which is 
(5,5) «_£_ exp (_l (log2l) V 2+e /^. 

Now we sum over the parameters previously held fixed. We have k < L; also, for i > 0, 
each ^ and z/j has the form Q of (5.5). Thus, the the number of possibilities for k and the 
Hi and z/j is 

(5.16) < L(l + y r l \f L <: exp(0((log 3 :r) 2 )). 
Next, we prove that 

(5.17) Vt{d) < (\og 2 x) 1 ' 2 
uniformly for the d under consideration, so that 

(5.18) L nid) ^ exp(0((log 2 :r) 1/2 log 4 :r)). 

Put m := PlPl+i • • • ■ Suppose first that Pl ^ Pl-i, so that m is a unitary divisor of a and 
d = 4>{m). If m ^ exp((log 2 x) 1 / 2 ), then (5.17) follows from the crude bound il(d) <C logrf. 
On the other hand, if m > exp((log 2 x) 1 ^ 2 ), then from (4) in the definition of srf^ we have 
£l(d) = f2(0(m)) <C log 2 m. But by (3) in the definition of and Lemma 5.2(i), 

log 2 m < log 2 pJ° log2 x < log 3 x + log 2 p L < log 3 x + g L log 2 x 



< log 3 x + q- 2 V^q l ° l og2 x < fT 2 v 71 ^ log 3 x < exp (O ( v^og^) ) , 

which again gives (5.17). Suppose now that pl = Pl-i- In this case, let m' be the largest 
divisor of a supported on the primes dividing m. Then d \ <f)(m'), and so fl(d) ^ Q(<j)(m')). 
Write m! = p-^m", where j ^ 2 and pi \ m"; both p 3 L and m" are unitary divisors of a. 
We have f2(0(m")) <C (log 2 a;) 1 / 2 , by mimicking the argument used for m in the case when 
Pl 7^ Pl-i- Also, f2(0(p^)) <C (log 2 x) 1//2 except possibly if p ] L > exp((log 2 x) 1 ^ 2 ), in which 
case, invoking (1) and (4) in the definition of =2^,, 

lOlog 2 0(p J L ) < 101og 2 pi ^ 101og 2 (log 2 :r) < log 3 x. 

So 

n(d) ^ fi(0(m')) = n(<Kp£)) + ^(^K')) « (log.x) 1 / 2 , 

confirming (5.17). 

Referring back to (5.15), we see that it remains to only to estimate the sum of 1/d. Since 
P + (d) ^ v k , (5.14) shows that every prime dividing d belongs to the set 0* := {p : log 2 p ^ 
{\og 2 x) l / 2+e ' 5 }. Thus, 

( 5 - 19 ) E^n( 1 + ^ + i+") << ex P(( lo g2-) 1/2W5 ). 

pg£ ja \ P P J 

Combining the estimates (5.15), (5.16), (5.18), and (5.19), we find that 

#{0(a) -.ae^a'e <,0(a) = a(a')} < ^exp (^(log 2 :r) 1/2+e/2 ^ , 
which completes the proof of Proposition 5.1 and of Theorem 1.1. 
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